Dynamic programming for ergodic control with partial observations

نویسنده

  • V. S. Borkar
چکیده

A dynamic programming principle is derived for a discrete time Markov control process taking values in a .nite dimensional space, with ergodic cost and partial observations. This uses the embedding of the process into another for which an accessible atom exists and hence a coupling argument can be used. In turn, this is used for deriving a martingale dynamic programming principle for ergodic control of partially observed di3usion processes, by ‘lifting’ appropriate estimates from a discrete time problem associated with it to the continuous time problem. c © 2002 Elsevier Science B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A partial history of the early development of continuous-time nonlinear stochastic systems theory

This article is a survey of the early development of selected areas in nonlinear continuous-time stochastic control. Key developments in optimal control and the dynamic programming principle, existence of optimal controls under complete and partial observations, nonlinear filtering, stochastic stability, the stochastic maximum principle and ergodic control are discussed. Issues concerning wide ...

متن کامل

Singular Ergodic Control for Multidimensional Gaussian-Poisson Processes

Singular control for multidimensional Gaussian-Poisson processes with a long-run (or ergodic) and a discounted criteria are discussed. The dynamic programming yields the corresponding Hamilton-JacobiBellman equations, which are discussed. Full details on the proofs and further extensions are left for coming works.

متن کامل

Stochastic Dynamic Programming with Markov Chains for Optimal Sustainable Control of the Forest Sector with Continuous Cover Forestry

We present a stochastic dynamic programming approach with Markov chains for optimal control of the forest sector. The forest is managed via continuous cover forestry and the complete system is sustainable. Forest industry production, logistic solutions and harvest levels are optimized based on the sequentially revealed states of the markets. Adaptive full system optimization is necessary for co...

متن کامل

Extracting Dynamics Matrix of Alignment Process for a Gimbaled Inertial Navigation System Using Heuristic Dynamic Programming Method

In this paper, with the aim of estimating internal dynamics matrix of a gimbaled Inertial Navigation system (as a discrete Linear system), the discretetime Hamilton-Jacobi-Bellman (HJB) equation for optimal control has been extracted. Heuristic Dynamic Programming algorithm (HDP) for solving equation has been presented and then a neural network approximation for cost function and control input ...

متن کامل

A New Approach to Distribution Fitting: Decision on Beliefs

We introduce a new approach to distribution fitting, called Decision on Beliefs (DOB). The objective is to identify the probability distribution function (PDF) of a random variable X with the greatest possible confidence. It is known that f X is a member of = { , , }. 1 m S f L f To reach this goal and select X f from this set, we utilize stochastic dynamic programming and formulate this proble...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002